voice control
LMPVC and Policy Bank: Adaptive voice control for industrial robots with code generating LLMs and reusable Pythonic policies
Modern industry is increasingly moving away from mass manufacturing, towards more specialized and personalized products. As manufacturing tasks become more complex, full automation is not always an option, human involvement may be required. This has increased the need for advanced human robot collaboration (HRC), and with it, improved methods for interaction, such as voice control. Recent advances in natural language processing, driven by artificial intelligence (AI), have the potential to answer this demand. Large language models (LLMs) have rapidly developed very impressive general reasoning capabilities, and many methods of applying this to robotics have been proposed, including through the use of code generation. This paper presents Language Model Program Voice Control (LMPVC), an LLM-based prototype voice control architecture with integrated policy programming and teaching capabilities, built for use with Robot Operating System 2 (ROS2) compatible robots. The architecture builds on prior works using code generation for voice control by implementing an additional programming and teaching system, the Policy Bank. We find this system can compensate for the limitations of the underlying LLM, and allow LMPVC to adapt to different downstream tasks without a slow and costly training process. The architecture and additional results are released on GitHub (https://github.com/ozzyuni/LMPVC).
- Europe > Finland > Pirkanmaa > Tampere (0.05)
- Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.04)
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Garmin Forerunner 970 review: the new benchmark for running watches
Garmin's new top running watch, the Forerunner 970, has very big shoes to fill as it attempts to replace one of the best training and race companions available. Can a built-in torch, a software revamp and voice control really make a difference? The Guardian's journalism is independent. We will earn a commission if you buy something through an affiliate link. The new top-of-the-line Forerunner takes the body of the outgoing Forerunner 965 and squeezes in a much brighter display, useful new running analytics and more of the advanced tech from Garmin's flagship adventure watch the Fenix 8. These upgrades come at a steep cost of 630 ( 750/ 750/A 1,399) – 30 more than its predecessor – placing it right at the top of the running and triathlon watch pile, although less than the 780 Fenix 8.
- Research Report (0.40)
- Overview (0.40)
- Information Technology > Hardware (1.00)
- Information Technology > Communications > Mobile (0.49)
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.37)
Tessan Remote Wall Outlet review: No Wi-Fi, no problem?
The Tessan Remote Wall Outlet is a simple and reliable way to wirelessly control lamps and small appliances. Some will value its independence from the internet; but for a similar price, a smart plug with app and voice control offers far more versatility. The Tessan Remote Wall Outlet is a no-frills way to control lamps, fans, and other household devices without getting off the couch. Instead of relying on Wi-Fi, an app, or a voice assistant, it uses a simple remote to turn a plugged-in light or small appliance on and off. That makes it an easy option for anyone who wants wireless control without dealing with smart home hubs, apps, or the accounts associated with them.
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.58)
- Information Technology > Communications > Networks (0.52)
Best Sonos Speakers (2025): Soundbars, Turntables, and More
After flooding our homes with every Sonos model you can buy (and filling all remaining space with the boxes of said speakers), then using them for a couple of years, we've come to value their audio fidelity and ability to network seamlessly together. There isn't another speaker system that lets you string together multiple speakers as easily or connect them to stream in different rooms of your home while keeping the audio perfectly in sync. The closest thing may be Google Assistant speakers, and Sonos connects to that system as well. Easy streaming: The Sonos app supports almost every streaming service in existence, and many apps, like Spotify, let you stream to Sonos speakers within them. The Sonos ecosystem can also handle home-theater applications and can support a full surround-sound setup.
- Media > Music (0.88)
- Leisure & Entertainment (0.88)
The best streaming devices for 2025
Nearly every TV on the market today is a smart TV, but not every operating system is a winner. A media streaming device lets you pair whichever user interface you prefer with just about any screen that has an HDMI port. In some cases, such as with older or less expensive smart TVs, a streaming stick or dongle could even be speedier and less glitchy than your TV's built-in system. At home, these handy gadgets make it easier for cord cutters to watch the millions of hours of content streaming services provide without cable. And while traveling, a streaming player lets you watch your preferred content on hotel sets (without painstakingly typing in a bunch of passwords or activation codes). We tested out streaming players from Roku, Google, Apple, Amazon and more, gauging the usability and the performance of each to come up with our list of the best streaming devices you can buy. Google's TV Streamer, the Apple TV 4K, Amazon's Fire TV Sticks and Roku devices are the most popular players in the space.
- Information Technology (1.00)
- Media > Television (0.90)
- Leisure & Entertainment > Games > Computer Games (0.70)
- Information Technology > Artificial Intelligence (1.00)
- Information Technology > Hardware (0.68)
- Information Technology > Communications > Mobile (0.49)
- Information Technology > Human Computer Interaction > Interfaces (0.49)
Stop talking to your phone: How to use Type to Siri
Among the changes ushered in with iOS 18.1, iPadOS 18.1, and macOS 15.1 Sequoia is a new Type to Siri option. This means you can carry on a conversation with Apple's digital assistant without having to talk out loud, which is helpful when you're in a quiet library, busy subway car, or anywhere else you can't really use voice control. The ability to type to Siri has actually been available on Apple devices for several years now, but previously it was hidden away in the Accessibility settings and not all that easy to find. Now Apple has given it much more prominence in its operating systems, so typing is just as straightforward as talking. Breakthroughs, discoveries, and DIY tips sent every weekday.
Unispeaker: A Unified Approach for Multimodality-driven Speaker Generation
Sheng, Zhengyan, Du, Zhihao, Lu, Heng, Zhang, Shiliang, Ling, Zhen-Hua
Recent advancements in personalized speech generation have brought synthetic speech increasingly close to the realism of target speakers' recordings, yet multimodal speaker generation remains on the rise. This paper introduces UniSpeaker, a unified approach for multimodality-driven speaker generation. Specifically, we propose a unified voice aggregator based on KV-Former, applying soft contrastive loss to map diverse voice description modalities into a shared voice space, ensuring that the generated voice aligns more closely with the input descriptions. To evaluate multimodality-driven voice control, we build the first multimodality-based voice control (MVC) benchmark, focusing on voice suitability, voice diversity, and speech quality. UniSpeaker is evaluated across five tasks using the MVC benchmark, and the experimental results demonstrate that UniSpeaker outperforms previous modality-specific models. Speech samples are available at \url{https://UniSpeaker.github.io}.
- Europe > Austria > Vienna (0.14)
- Asia > South Korea > Seoul > Seoul (0.05)
- Asia > China (0.04)
- (8 more...)
The 15 Best Black Friday Deals From Best Buy (2024)
Black Friday is upon us once again, and that means great deals on all the gear you've been eyeing that seemed just a bit too pricey. Below we've rounded up our favorite Black Friday Best Buy deals, bringing the best of the store into your living room. Now is the time to strike, so whether you're after a sweet new screen, a smarter security camera, or any number of cool gadgets with a temptingly slashed price tag, you'll find the perfect holiday shopping fare below. Get best-in-class reporting that's too important to ignore for just 2.50 1 per month for 1 year. Includes unlimited digital access and exclusive subscriber-only content.
- Information Technology > Hardware (0.49)
- Information Technology > Artificial Intelligence (0.49)
- Information Technology > Communications > Mobile (0.48)
Avoiding Siri slipups and apologies for butt dials
Voice assistants may cause confusion across devices. Tech expert Kurt Knutsson offers some solutions to fix it. When it comes to using voice assistants across multiple devices, things can get a bit tricky. "Mike" from St. George, Utah, found himself in a comical yet frustrating situation with his personal and work iPhones. Let's dive into his predicament and explore some solutions.
- Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.70)
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.67)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.59)
- Information Technology > Communications > Mobile (0.50)
DJI Neo review: The best 200 drone ever made
When DJI revealed its tiny 200 Neo drone, I immediately saw how it could fit into my vlogger's toolkit to supplement my Mini 4 Pro and Mavic 3 Pro. Flying those sophisticated drones is a whole thing that requires planning. But the Neo can be launched spontaneously to grab quick and fun shots, thanks to features like palm takeoff and voice control. That ease of use also makes it ideal for the social media influencers. You get features from DJI's bigger drones like ActiveTrack, FPV capabilities and even support for DJI's Mic 2. And forget about the fuzzy video you may have seen on other cheap drones. The Neo can record in sharp 4K, making it suitable for content creators who need affordable aerial video.
- North America > United States (0.04)
- Europe (0.04)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
- Information Technology > Communications > Social Media (0.92)